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Abstract 



r 



As data on the reliability and validity of ratings of infant .temperament 

■ * , ..." . ■. ■ ■ 6. *■'.■.. 

have accumulated, researchers have begun to question what caregiver ratings * 

really measure. Cairns and Green (1979) have argued that ratings of social 

■ ; . •. •. ' ; ■ . ■ • , \ ■ ' '■ 9 

'behavior are less a reflection of enduring individual differences than a measure 
of rater characteristics and error variance. This .sturdy investigated these 
extraneous- sources of variance in Carey 1 s Infant Temperament Questionnaire (ITQ). 

A large, diverse sample of mothers' completed ITQs on their 6-month-olds. " \ '■ .: 

: Y ■' ■ < ■ • . ' V ' - ' ' •'• ' . 

Item analyses first identified a cluster of items that were left blank by * a 

select, middle class group of mothers. Next, a large number of ite^s (40%") 

were found to r exhibit marked response biases. Finally, parental characteristics 

were shown to affect response biases and styles, Implying that systematic biases 

are built into- the scale. Temperament ratings also^varied with social class and 

ethnicity as well as knowledge of infant development. Infants judged as easier 

tended to have mothers who were white, middle class, and more knowledgeable. 

' . - - ■ * .* . . • 

* ■% • 

These results suggest that caregiver ratings on the ITQ reflect more than mere 

~ : ' .1 . . • ■ .* ■ ■ ♦ . . . J 

stylistsic differences in infant behavior. ^ 
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A major issue in the • study of infant t'emperament concerns ^the extent to 



which caregiver ratings accurately portray infant behavior. Confidence in the 
validity of ''such ratings has been eroded by 'recent f indings from two areas of , . , . 

. research. First, studies of motherrf ajther and -parent -observer agreement typically 
find correlations on the order of .20 to .50 4 (Bate 4 s, Freelarid & Lbunsbury, 1979; \ / 

. Wilson & Matheny,* 1983) . v Given that less than 25% of the variance is being • ^ 
explained, this suggests that something other than stable individual differences 
is* being a^s sussed. Second, attempts to document relationships between parental ^ 
attributes Ce.g., SES; race; altitudes; personality traits) and rVtings of infant 
temper&ment usually are ^jfcite successful* (Bates, et ^al^Vvl.979; Crockenberg & 

«■ n " » . • . ; - • ■ - 9 . ~ • - " 

Credolo, 1983; Siifiorids, & 'Simonds, 1982; for reviews, ^ee Bates, 1980, and Hubert* ; 
Wachs, Peters-Martin & Gandour, 1982). In "fact, parental .attributes have been 
found to Ije more predictive of v^emperament ratings than infant characteristics 
(Sameroff , Seifer & Elias, 1982) , raising the possibility |hat*ratihgs 
be projections of ; the parent onto the rating scale. The lack of validity and 

fnterrater reliability has been variously attributed to flaws in the instruments: 

. w • . ■»•-'■■ '•'*•• - . ■ ^ 

to differences in familiarity with t;he target child;.. to behavior that is person- 
/or' situation-specific; ahd to the influence of subjective perceptidns by the, parent. 
, - Taking a broader view of ratings of social behavior clarifies the sources of 
variance in temperament ratings. The analysis by Cairns and Green (1979) of y 
ratings versus qbservatibijs ^of social behavior (see figure 1) may not* change some 
researchers 1 antipathy toward ratings, but at least the reasons for any pessimism 

are apparent. Thes'e authors arguie that while the goal of ratings is to capture 

\ . , If----'--- *' .'. • • . ' • • 

enduring individual tra,its, most of the variance actually i^ due to characteristics, 

of the rater or to error variance. Rather chafact- eristics include idiosyncratic * 

interpretations 6f the cbnstfruct, knowledge of the individual child and tKe refer- * 

. ■ - , ■ 4 ' ■■' . 1 ; " , « • ^ ~ 

ence population, .and scaling of the individual onto the group distribution. 

/ - ■' . ^ . ; , \ ■ ^ • % . ■ •■ . 
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Parental expectations atiud beliefs also taay iaediate temperament ratings, and would 
be most evident when looking at the effects of parity (Goodnow, in press) or 

' . \ ; ^ ■.•.•v ; ' : ,' *: . ; . , : . . ■. 

subcultural* differences, . In k contrast, observational techniques primarily reflect 

■•••••..•'*• o. ' •* • • . ' .-" .•• -* , ' -v. ... 

tlie influences of the setting and s t\ie dyad. Thus, strong correlations between 

parent ratings of temperament and 'observational measures would not be expected 

because^ the two' techniques serve different purposes. Furthermore, twc^ raters 

might differ in their familiarity with the target infant and infants in general; 

in .their interpretation of /tfie scale items and metric; and in their expectations/ ^ 

and beliefs. All of these will conspire jLo depress interrupter r'eliabi%Lty*l Such 

an analysis of the task confronting raters calls into v question the validity of 

infant temperament ratings -as pute measures of infant behavior. ; 3 : 
. • » - >. ,"" v • ' W ... j \: J ,V . 

This study is particularly concerned with two extraneous sources of variance 

mentioned by Cairns and* Gyeen. - J^st^if "instrum^lDt f allure? 1 is the primary ' 

culprit for the ppor reliability aii<f valid it y^f temper amen t scalefe, 3 as^Hubert,. 

et al. (1982) concluded, thjen tfels area .of inquiry could benef it from an .analysis 

of the error variance thaferef lects^fJoor methodology. That is, a consideration 6f 

response biases and styles, problems 'of scaling, ^nd item selection might illuminate 

techniques for reducing error variance, increasing our confidence that we indeed 

are measuring infant temperament. Among some of the tef f ects- contributing to error ■ 

variance arfe response styles and biases. A responspf style is /an- individual 

• ■• },.'"'■■ ' ■ - ' ' : '"■ -A-/, .'/'• v *7 •. •;>■:■" „ ' 

difference "which is an art if actual product of the measurement methods and is .at 

V X .. . • ' v .' r ; ; J/-J< /"::;.?■■ • ' 

least partially' independent of th$ trait which the n^asuremepl: methods are 



intended to measure 11 (Nunnally^ 1967, p. 593). On a temperament scale with 6 



options, for instance,' some parents may select only the extreme choices while 
others may consistently choose the middle two. A response bias, in turn; is a 
group measurement ajrtif -^ct, such as choosing" th^, "Almost /always 11 ^option. y 

x . : *• ./ "■: •' w. ; 

*■* * • / 

• ■ •' ... ' •;. • - v 
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Finally, biases may be introduced through the ^ selection ojf items that do riot apply 
to'a select group of subjects, or items that are; highly ? correlated yith variables 

unrelated to temperament. All- of these extraneous factors become a property of the 

■ ■ ' ^ v •* •• •:'* '. * ' 9 ■ ' ' ■' -'.'"V." .-. 
; rating scale during its construction and inflate the error variance' to a greater 

.... . . , -;*.■■ ,o„ ./. -." \ • ■;.',»..■ ,'.■>' ■ ' 

or lesser 'extent ? '7 '* - " ■ ' \. • , •' r ■■■■ •■ 

! ' •:. ' 7 9 j - ; . ' >' •- ■ ; "■ / ■ . •■ . ; ■ ; . '■ . • •• ' ". v • oV • . 

■ V , The second Source of variance "this study.dnvestigates is rater characteristics, 
especially those that might affect the "anchor". Anchoring, effects, which pertain,, 
to th^'origin and unit of - t;he\ subjectiv^continuum" ^Torgerpdn, 1958, p. 79) the 
parent uses in rating the infant, , will arise from the individual 1 s experiences' tli4t 
shape ea^^tatiohs "or perceptions of what is typical and atypical. . These expetta^ 
t ions then^ay be/ used as a reference point; for deciding where the targSB infant ^ ; V : 



In 



should" be plaqed on the item. \ Parents who are well-versed in infant Behavior arid V 

•■ ' '•■ '. : * ' ' ■ * • - ! ' '" 

development might have a ^different anchor — -and so rate their infants differently— 

than parents who have^been exposed to only a few infants otherv than thei^ qwn. , : - 

Similarly,; cultural differences in Expectations may; affect rat " 

sum, , Cairns and Grefen's (1979^) analysis of sources-^ ^variance in ratings 

of social behavior Is applied to ?arey and McDevitt's (1978) Infant Temperament 

Questionnaire (ITQ) . s The first iss^ie to be addressed^ xid.ll be properties pf t^ie ITQ 

that might- contribute to error variance, such as response styles, ^espdnse, biases, 

and biases due to item selection* The second area of investigation eotflfcerns the^ 

effects of parental expectations on temperament ratings. These expectancies for , 

, w 5 ; . " ' '. ' - . /V" ' '/ ' . ;/ ° '«'"..■ '■' ^ • ■ -.' 

infant behavior— hypothesized to vaify,.with social class, ethnicity" and ? knowledge . 

of infant development—are assumed to mediate temperament ratings. v 

s - ' • ' ' ■ , , • , ,: 4 

• v. • Method-' ' ■ " „, /// ■ '* •. , 

-. • " J . * '. • ' ' ; ^ ,r v • • 

, Subjects : . J ' : " ' ;< .. ? '. 

. • . ; , * ■ • * ' . ' 1 ■ •: . # • ; ■ ■ . y': • ' . 

Mothers o€ 6-month-old infants were selected from the birth ^records of a 

. * • / ' J( . • • - u ■'• : . •■'^ ' ( - 

regional referral hospital 'in Chapel iH^i, subject to the following criteria:, ■} 



ERIC 



(1) no* life-threatening complications during pregnancy and delivery; (2) birth- 

. c '■' ... ■ . ■■■ . ■ 9 \ ■ . * 

weight greater than 2700 grams; and (3) { 5-minuteApgar l greater than 5. Of the *: 

434 potential subjects^ 26% could not be contacted because they did not have a % • 

telephone or they had moved from the area. • Four percent of those contacted 

declined to participate in the study, andean additional 16% failed to return the . 

questionnaires, leaving a sample of 256 * subjects,. The sample represents a diverse- 

— ^bup of mothers fn terms of educatigrf (X . ■ 13.5 yrs.; range * 7-^2); age (X = • 

\ 26,2 yrs,; range =16-43); race (60a^jw*iite) ; parity (X -1.66 children; range f' 

l<-6); and social class (Duncan SEI X = 37.6; range = 0-96). 

Procedure V * 

- All/mothers, who.^greed to participate were mailed the Knowledge of. Infant 

. . % ' . '* • . ■ " , ■*•''. i ■ 

Development Inventory (MacPhee, Note 1) ,^ a- 75-^item questionnaire* €hat assesses - ' 
^familiarity with infant norms and milestones; principles of growth>a)ad development*; ■:; 
parenting strategies and skills; and hAk&th c^re 4 and safety issues. In addition, 



196 of the subjects completed the ITQ. The I k TQ* contains 95 items in 9 categories. 
Subjects rate the infant 1 s behavior on a 6-point scale ranging- from "Almost never 11 
to Almost always! 1 . An &|ditio 0 nal series of questions solicits the, rater 's^ 
general impressions of the infant's behavior^ on „each of the 9 temperament " d^ensions. 
The completed f orms were^ireturned in . envelopes supplied to each mother. *r 
" Y " ■ ' ' •••v * • Results . • • , ' 'V ' , ■' " ■ ., ■ * 

Response Biases and StjgLeis . - / fc * ° & 

n Biases .may be feuilt.into^ a rating scale by including items with skewed 

"■" ....... ' . ' • • ' ■ ' 

frequency distributions (due to anchoring effects), or by choosing items that are ' 
not relevant to a ^select group of parents or infants. Items that are often left * 



tblank dh the ITQ provide an illustration of .this latter type of bias. Item 
5 analystffe identified a* cluster of 11 items that were no^ Answered by at least 10% 
# of the sample; these items accounted for 55% of the missing data on the ITQ (see 
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Apperfdix A). MSrst of them (6) pertained to^adkpt&bility and approach to | new 
people, procedures , ' of settings. Five items cohcerned bottle or solid feedings, 
experiences that may pot apply to most * breastfed 6-month-olds — as mothers 1 comments 
the items implied. Finally, the correlation: between social class and the 




ber pf items left blank (r = .35,vp < .0001) suggests that much of the missing 
data on the ITQ can be attributed to events not; encountered by a select,' usually 



middle class groiap of infants. ^ , v ' , 

Biases due to anchoring effects will be observed on items where^ one Response 
Option is chosen with disproportionate frequency. On the ITQ, for example, each 
of the- 6 options would be chosen 16.67^ of {the time under a random model. In 
fact, however, 38 of the 95 items had k an optkon, that was chosen' a t greater than * 
twice 'the expected frequency. In general, mothers had a predeliction for the 
extremes, especially the "Almost always 11 option. ^ As Figure 2 demonstrates,- 
^rev^r^ing the items^ merely transplants the. response set from one extreme to the 
other, with only a modest reduction iii bias. Finally, 12 items v exhibiting * 
markedly skewed distributions* were .examined (see App.endix B) . It becomes apparent 
upon reading the i£ems that they describe some general properties, 'of infants 
rather than behaviots that capture individual differences' in temperament.^ °* 
Evidently, most mothers believe ^that infants are active - and Idist.ractible, which 
places the "anchor 11 for these, items at the extreme. i ; 

tHfe^particular strategy or respons^ style\the rater uses also will^ introduce 
rater variance that is unrelated to the characteristics of the infant being rated. 



Mothers, like many raters, appear to choose either the extreme or middle options. 
As Figure 3* illustrates, mothers who chose "Almost never 11 CN1) were also likely 
to select "Almost always" (N6) , but not options 3 br 4 which, in turn-, were 
correlated , with each o'ther. VMiddle class motljers . tended to be more Conservative 
in that they were, more lively to use option 3 (r. = -34, £ ^ .0001 for SES and N3) 



while lower "class mothers showed" a greater preference for "Almost always" (r_ 
-^29, £<.0001 for SES and N6). Again, attributes? and strategies of the rater 

' i - • ' ' • ■ ' . ■ ' , / ' X "'. ' ;. .' * «. . • 

are found to influence ratings of infant temperament* 

If response bias on the TTQ' is jointly determined by the inclusion of items^ 
wi^n skewed distributions and by some raters who rigidly select only one or two 



bptions,* perhaps this source of variance can be reduced by eliminating certain 
items, and subjects fronj the analyses. Accordingly, \the dozen items in Appendix 



B and 14 subjects who chose one option at least 50% of the tiirfe were dropped. 
Now when- the frequency of choosing each option is examined, the largest deviation , 
from the expected rate is* 3-. 5%, as compared to 8.9% in tlie original raw data., ^ 
Ratings of infant characteristics, on the JETQ thus might beWre valid if error and v 
rate?r variance' were reduced. 




Maternal Characteristics and Temperamep4srlRatings \ 
— r 7T~ ' ■ \ 

[ Because several' of the -ITQ ca£egory^scores are intercorrelated, t the first 

step 3^ the next stage 'of analyses was to f actdr aiialyzd the data; Four of the 

five ITQ categories on the first factor- (see Table 1) are used in calculating the 

' ./ " ' °t ' j# ' \ V 

infant's Easy/Difficult classification. While £h±3 lands' some validity to the 

..' " ' ' v ' . " ; . i , ■ y " . . 

»• procedure, it also suggests a more parsimonious' scheme for^ summarizing the data— 

• * ^ ° \ „ ' >' ■ \ 

one using £ continuous rither than discrete "variable. Factor 2 is consistent 

with Buss and Plbmin f s ( (1975) discussion of activity level in fcenj!^ of * vigor and . p 

tempo. Finally, an infant with a high &core on Factor 3 could be described as 

■>' «. 9 

pathetic or phlegmatic: low in»persistence and relatively unresponsive to 
stimuli. ■ ■ / ■ _ . % . 

■■/ The factor scores then were used as the dependent variables in a 2« (SES) X 
2 (Race) MANOVA. A marginal main effect for social class and a strong effect 
* due to race were observed on the .Easy/Difficult factor (see. Figure 4), replicating 
Sameroff , et al. f s (1982) results with an earlier version of the ITQ. No differences 



/ 
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T emerged on Factor 2, but, on Factor 3, lower SES infants regardless of race were 

«• - * ^ ' 

rated as more phlegmatic. As in the Samerofif, et al. (19§2) study, the latter 

' ' - ■ \ . . • " • 

effect seems to be carried more "by scores on Threshold (i: ■ -.21, £4.j005 with 

SES), rather than Persistence (£ j= .10, £ =» ,15 with SES). Thus, Ifcwer SES 
mothers rated their .infants as being ^somewhat more apathetic and difficulty 
Differences became particularly evident m when examining the relationship between 
race, and the Easy/Difficult factor. These effects cannot be explained by a 
confounding of race and soci^L class because .there were enough middle SES- Blacks 
in this sample (Nf16) to explore the"" independent contributions of the two 
variables to temperament ratings. Whether these racial differences 'fire attri- 
butable to constitutional differences in infant temperament/or to cultural 

influences on parental interpretations of infant behavior is not cl6ar but, 

'.' ' ••<.'. \ ■ * ' i ' . \ . ' ; 

in light of contemporary interact ionist positions ' (Freedman, 1974; Bates, 1980), 

' • . ... " '/>•'• ■ ■ " ." '" 

'the issue may be moot. , * 

" . < ■ ; v . 

^ ' Cairns anc\*£reen (19(79) have argued that one task confronting a tktex is to 
interpret the target infant's ^attributes within the framework of a reference ^ 
population. That is, the rater must be familiar with the group distribution-- 

or nQrms — in order to scale the indivi^yal onto it. Parents 1 knowledge of infant 

» ■ . ■ ■ 

development, therefore,' should.be related their ratings *6t infant temperament. 

Correlations between scores on the knowledge scale and ratings on the ITQ partially 



j 

r confirm this hypothesis (see Table 2). Mothers having more accurate knowledge 
'rated their infants as being easier in 'temperament . Neither method covariance— 
a mother's skill on papqr-and-pencil tests — nor the process described by Cairns 
and Green, though, can explain the lack of significant results on Factors 2 and 
3. It is* plausible, therefore, that more knowledgeable mothers are more skilled - 
at maintaining their infants in a eas£, contented state; or these mothers may be 
more confident and relaxed with their babies and so rate them as being easier. 

"4 • ' 
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The latter interpretation is supported by Krons tad t, Oberklaid, Ferb, and Swartz 
(1979) and'by Sameroff , et al. (1982), who found correlations of similar magnitude 
between maternal anxiety and ratings of the* infant as difficult on four of the 

. v>> ■ .<* " 

same ITQ categories (Approach; Adaptability; Moqd; and Rhythmicity) . ; . 

An indication of how strongly maternal characteristics contribute to ratings 
of infant temperament can be gleaned from 3 multiple regressions, usinf| SES, race, 
and knowledge 'scores to predict the ITQ factor scores. For the Easy/Difficult 
factor, R 2 .24 [F(3,190) ■ 9. 96, , .0001] with all three predictors making 
signific&nt contributions, especially knowledge. The equation for Factor 2 was 
non-significant but for Factor 3, R 2 «. .08 [F(3,190) -^70, £^.02] with SES 
making the only significant contribution. Rater characteristics thus d£ influence 
ratings of infant temperament to a modest but significant degree. 
' ■ ■ Discussion 
In conclusion, what do ratings*, of infant temperament really measure? Several 
different answers to this rhetorically question were suggested by Cairns and Green 
(1979), each with different implications for how future research and instrument 

^development should proceed. First, parental ratings may faithfully record the 
infant's temperament, measured across time and settings. .Previous research (see 

.Bates, 1980) and the results of this study, though, suggest that such ratings ^ 
are an amalgamation of parent and infant characteristics; an assessment of 

v . ^ ' 

parental int erpr e tat ions of behavior. Thus, a second source of variance will be 
the skills, biases and expectations of the rater. Problems with interrater 
reliability then should lead to questions about the processes involved in rating 

infants that might --cause different raters to reach different conclusions. In 

* • * ■ 

this study, expectations about infants in general — the reference point or "anchor"-- 
were hypothesized to\influence the placement of the individual child on a subjective 
continuum. The correlations between mothers 1 knowledge of infant development and 
ratings of infant, temperament partially support this .thesis* - ■ 



The Contribution of rater variance to temperament ratings highlights an 
issue that has been the' cause of headed debate. in some circles: how to capitalize 
oh the parent's vast experience wi£h the infant while ensuring som6 measure of 
accuracy in reporting the ipfant's behavior (see Carey, 1981) > The present; study 
hints at several techniques that might minimize the effects of rater variance. 

First, training parents as raters would result in some uniformity if a way could 

- . » 

■ * • • ■ / 

be found to provide similar anchors and units of -scaling. This might be 

■ ' ' % . ■ ' ' -V . V" . 

accomplished by providing each parent with vignettes of the prototypic "difficult 11 

• • . • » ■ , 

and "easy 11 infant. Or, more complete descriptions 'of each response option could 

/ 

be written. Second, raters, whether mothers and fathers or parents and observers, , 

differ in their familiarity with the target infant and infants in general/ 

Covarying scores on^some assessment of expectations or perceptions of the average 

infant thus may enhance interrater reliability. In 'either case, the contribution 

of parental characteristics^, to temperament ratings will remain an obstacle Airitil. 

standardization of - raters is achieved. 

Another reply to what temperament ratings really measure involves f laws \ in , 

the instrument, an area of inquiry that has been relatively neglected. "Instru- ' 

ment failure 11 , to use Hubert, et.al.'s (1982) term, fundamentally is a problem 

of item wording and selection, and the choice of response options. The clustering 

of items into categories or scales — guided either by theory and/or empirical 

. ■, c > . - 

relationships — also would be considered under this rubric. On the ITQ, fo^r— - 

' ' • v : \ ; . .:• •'- . ' 

example, faulty selection led to the inclusion of "some very skewed items that 

are descriptors „of general infant characteristics • The pattern of missing data— 

again, a product of item selection — also introduces extraneous variance into the 

scale., Faqtor analyses of .the ITQ category scores in this study an<^jLr^revious 

ones (e.g., Sameroff, et al., 1982) suggest a more parsimonious and possibly more 

valid method for reducing the data". Finally, the response bias toward the 
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* . . A • •' ' ;\v '."v.-, ; , '•••''.*.> *'"• ••'.•■•.!■ . . .7 i . 

■ , ■■ ' - ■■ ""Xv \ ■ \ • ' • • . : ■ - . ■•- • .*. • : • * '•• ■•■ ■ 

• - v > . . / .7 : • . : .' : :'.y 

"Almost always 11 option, ajid the twd response styles of choosing either the extreme 
or- middle options, again raises „the problematic general issue of whether such 
ratingsVare an accurate measure of a "difficult 11 or "easy" infant, or a flaw in 
the instrument. * It is impossible to decide post hoc which explanation is correct 
without. ^spme independent Assessment of the infant. Therefore, the validity of 
the ITQ as a. measure of infant behavior remains to be- verified. 1 . 

• - - ■- ■ ■ . ' >' 

_ . . • ■ v * ' « - •«»•":•<* ' 

There are several potential techniques for minimizing error variance in 
temperament ratings. First,- disposing of extreme responses by deleting subjects 
who select § given alternative more than 50% of the time, as in this stddy^may 
decrease the skewed response prof ile in Figure 2 but at the expense of throwing 
out the baby with the bathwater. A more defensible approach would be to eliminate 
items with skewed frequency distributions or a high probability of being left 
blank. Or, a validity scale similar^ to the L scale of the MMPI could be dd^eloped 
to identify raters who fail to respond to the range of options. One also could 
use interrater correlations as a criterion in selecting items,, keeping those that 
have high parent-observer or mother -father agreement. This would provide some 
•■" assurance that temperament ratings are more a product of individual differences 
: \ in behavior across settings and raters, rather than error or rater variance. 
Whatever tactic is adopted, though, it is clear that revisions of the instrument 
are In order, 

The results of this study reinforce the conclusion of Hubert, et al. (1982) 
v that the issue of the validity of temperament ratings ultimately is entangled 
with psychometric concerns. That is, the presence of biases due to item selection 
response styles, and a consistent pattern of relationships with rater character- 
istics imply that ratings of infant temperament also are measuring error /and . 
rater variance. We will have valid measures of infant temperament only when • 
these extraneous influences are minimized. 
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Table 1 



Factor Analysis of the ITQ Categories * 



ITQ Factor 
II 



III 



Easy/Difficult 
Approach* 
Adaptability* 
Mood* 
Distractibility 
Rhythmicity* 

Activity/Reactivity 
Activity Level 
Intepsity* 

Phlegmatic 

Persistence 

Threshold 



.86 
.84 

• 75 
.61 

^57 



.80 
.77 



.74 
-.63 



* Category is used in calculating the infant f s Easy/Difficult 
classification. 



Table 2 ' . 

Knowledge of Infant Development and s * 
Ratings 'of Infant ^raperament^ 



■ ' ■ Accuracy on - 

Carey, ITQ the Knowledge Scale 



Easy/Difficult 

Approach -v-^ -.31* 

' ' ' V 

Adaptability / -.40* 

Mood \ t-29* 

Rhythmicity. ,.• % ^ -.30*' 

Distractibility -.25* 

Activity/Reactivity 

Activity Kfevel „ .11 . 

Intensity -.06 

Phlegmatic , 

Persistence -;03 
Threshold — , -.12 

* 2 < .0002 

Lower temperament scores are achieved by infants who are 
rated as easier, . less ^ctive and intense, and leSs 
phlegmatic. 



APPENDIX A 

, V. 

Items Frequently Left Blank on the ITQ 

: $ 



l\em No.^ 


Category 


Item 


: 

* • 4 

19 


y 

Adaptability 


Resists changers in feeding schedule. 


\ 31 


Add roach 


Initially rejects a new babysitter. j 


34 

» 

• 


Adaptability 


Objects to being bathed by a* nert person; in a 
new place. 


35 


Rhythmicity 


Amount of milk taken is unpredictable. 


56 


Adaptability 


Adjusts easily to^changes in place or time* for 
sleep. 


59 


bistractibility 


Can be calmed if fussing about a soiled diaper 


62 


Approach " 


Accepts changes in place (^r time for bath. 


68 , 


ETstrS&tit^^ity 


Rejects disliked food or medicine. 


76 - 


Mood 


^^"Fussy or moody when has a ^eold or virus. 

Rejects disliked food even |Lf mixed with a 
preferred one. J , 


83 


Threshold 


" 94 

* 


Adaptability 


Accepts changes in solid food feedings. 
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^ APPENDIX B. ^ 
Itemo Exhibiting Marked ReapiSnne Bias on the^Q* 



Option choaen* s 
Item No. and Category 



Item 




71 



-87- 



95 



6 - 72% 



6 - «51% 



6 - 81% 



_& 54% 



6 - 58% 



Adaptability 

Activity 

Intensity 
Distractibility 
Rhythmicity ■ 

Activity 
Adaptability 

Activity 
Distractibility 

Activity 

.Distractibility 

Activity 



Accepts, bath at any time of 'day without 
resisting." 

Moves about rntfch during diapering and 
dressing. 

Resists additional food or milk when full. 

Stops play and watches when someone walks by. 

Gets sleepy at about the same time each 
evening . 

Moves much when lying awake in ,crib. 

Objects to being bathed by a new person 
or in a new place. 

Plays actively with^parents. 

■> * 

Watches another toy even though already 
* holding one. 

Actively graspi or touches objects within' 
reach. 

Stops suctcing and looks when hears an 
unusual noise. 

Moves much when playing by self. 



* The parent is* given 6 options for each item, ranging from 1 (Almost never) to 

6 (Almost always). % 



